CDS

Accession Number TCMCG010C22016
gbkey CDS
Protein Id XP_016574884.1
Location complement(join(30754526..30755027,30755115..30755159,30756388..30756464,30756560..30756676,30757456..30757550,30758469..30758538,30758703..30758800,30760060..30760110,30763057..30763099,30764208..30764298,30764592..30764917,30766595..30766921))
Gene LOC107872793
GeneID 107872793
Organism Capsicum annuum

Protein

Length 613aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA319678
db_source XM_016719398.1
Definition PREDICTED: uncharacterized protein LOC107872793 isoform X2 [Capsicum annuum]

EGGNOG-MAPPER Annotation

COG_category S
Description Protein of unknown function (DUF668)
KEGG_TC -
KEGG_Module M00406        [VIEW IN KEGG]
M00430        [VIEW IN KEGG]
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko00002        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
ko03019        [VIEW IN KEGG]
ko03041        [VIEW IN KEGG]
KEGG_ko ko:K12812        [VIEW IN KEGG]
EC 3.6.4.13        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko03013        [VIEW IN KEGG]
ko03015        [VIEW IN KEGG]
ko03040        [VIEW IN KEGG]
ko05164        [VIEW IN KEGG]
map03013        [VIEW IN KEGG]
map03015        [VIEW IN KEGG]
map03040        [VIEW IN KEGG]
map05164        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGGGTGGTCTTTGCTCTAGAAGAGCCACCAGTGAGAATACAACAGGTCGTTCCATACCACATGTCAACGGTCACTTCAACTATGGTGCTGGAACAGTTTATCAGTCACATAGATTGCCCACACAGGCAAATAATGATACTATGCGATCTCCATCCGGAGAAAGCACAGAAAAGCAACCAAGTGAACCAGTGTTCTCTTTTCCAGAGATGAATGCAGCTTCCCACGGTCTTGAAATGGATGATATAAATGATGGAATTCCTCGATTGTCTCGGGCATTATCAAATAAAACCAGATCAACAAGGTCAAAGCAGGTTGCCAGGGCAAAGGTTTCAGAAGTGAGTTCACTTTTGGGCAGAGCTGGTACAGTGGGGTTTGGCAAGGCAGTAGATGTGTTGGACACTCTAGGTAGCAGCATGACAAATTTAAACCTTAGTGGTGGCTTTGCATCCAGCATGGCAACCAAGGGAAATAAAATTTCCATTTTGGCATTTGAAGTTGCAAATACAATTGTCAAAGGTGCCAATCTTATGTATTCCCTTTCAAAAGAGAACATTAAGCATTTGAAGGAGGTGGTTCTCCCTTCTGAAGGTGTGCAGTTGTTGATATCAAAAGATATGGATGAGCTCTTCAGAATTGCTGCAGCAGACAAAAGGTTGGAATCAGAATTGACACCTCATAAACAATTAAAAGAAGAAGCTGAGACTGTGATGCTGCACTTAATGACCTTGGTACAGTATACAGCTGAACTATACCATGAATTGCATGCATTGGACAGAATTGAACAAGATTGTCGACGTAAAGCCCAAGAAGAGGATACTTCAAATGCTACTCAGAGAGGAGACGGCCTTGCAATTTTGAGAGCAGAGTTGAAAAGTCAAAAGAAACATGTTAAAAGTCTAAAAAAGAAGTCTCTCTGGTCCAAAATATTGGAAGAGGTGATGGAAAAGCTTGTGGACATTGTCCATTTTCTCCATTTGGAGATCCATGCTGCATTTAGCAGCACCGATGGAGACCGACCAATAAAAACCAACCATCAGAGATTAGGATCTGCTGGTCTTGCATTACATTATGCAAATATCATTACTCAGATTGATACGCTTGTCACCCGATCAGGTTCAGTGCCCCCAAATACAAGAGATGCTTTATACCAAGGGTTGCCACCAAACATCAAGTCAGCTTTGCGATTCAAAGTACAATCGTTCCAGCTCAAGGAAGAGTTAACTGTGCAACAAATCAAGGCTGAAATGGAGAAAACACTGCAGTGGCTTGTTCCCATGGCAACCAATACAACCAAGGCCCACCATGGCTTTGGATGGGTTGGAGAATGGGCAAATACAGGGAAACCTGCTGGCCAGACTGATCTACTCCGAATTGAGACACTCTATCATGCCGATAAGGAGAAAACTGAAGCTTATATTCTTGAATTGGTTTTATGGCTTCATCATCTTGTCACTCAGTCGCGAAGTGCTGCAAATGGTGGAATCAGATCTCCTGTGAAATCTCCATGCTGTTACCCTAATCAAAAGATGAATCAGTTAACACACAAGCCAAGTTCTCCATCTCCCGCATTAACAGTTGAAGACCAAGAAATGCTTCGGGATGTAAGCAAGAGAAAACTGACACCCGGAATAAGCAAGTCTCAAGAATTTGATACTGCAAGAACAAGGTTGAGCAAGTTCCACAGGCTGAGTAAGAGTAGTAACCATACCCCTATACGTGAAACCAGGAAAGACCCCTTTCCCATCAGGAGACCGTCTTCTGTTCCAGTGATTGACTTTGACATTGATCGGCTCAAAGCTTTGGATGTCATTGATAGAGTTGATACAATTCGAGGTGCATAA
Protein:  
MGGLCSRRATSENTTGRSIPHVNGHFNYGAGTVYQSHRLPTQANNDTMRSPSGESTEKQPSEPVFSFPEMNAASHGLEMDDINDGIPRLSRALSNKTRSTRSKQVARAKVSEVSSLLGRAGTVGFGKAVDVLDTLGSSMTNLNLSGGFASSMATKGNKISILAFEVANTIVKGANLMYSLSKENIKHLKEVVLPSEGVQLLISKDMDELFRIAAADKRLESELTPHKQLKEEAETVMLHLMTLVQYTAELYHELHALDRIEQDCRRKAQEEDTSNATQRGDGLAILRAELKSQKKHVKSLKKKSLWSKILEEVMEKLVDIVHFLHLEIHAAFSSTDGDRPIKTNHQRLGSAGLALHYANIITQIDTLVTRSGSVPPNTRDALYQGLPPNIKSALRFKVQSFQLKEELTVQQIKAEMEKTLQWLVPMATNTTKAHHGFGWVGEWANTGKPAGQTDLLRIETLYHADKEKTEAYILELVLWLHHLVTQSRSAANGGIRSPVKSPCCYPNQKMNQLTHKPSSPSPALTVEDQEMLRDVSKRKLTPGISKSQEFDTARTRLSKFHRLSKSSNHTPIRETRKDPFPIRRPSSVPVIDFDIDRLKALDVIDRVDTIRGA